Video SMS Message Sending and Receiving Methods and Apparatuses Thereof, and Handheld Electronic Device

ABSTRACT

Embodiments of the present invention provide video SMS message sending and receiving methods and apparatuses thereof, and a handheld electronic device. The video SMS message sending method of the present invention includes: acquiring audio/video content, synchronously acquiring graffiti input corresponding to the audio/video content, and combining the audio/video content and the graffiti input into a graffiti video file; sending the graffiti video file as a video SMS message to an SMS message receiver. According to the embodiments of the present invention, a key point in a video SMS message can be emphasized, optimizing a video SMS message display effect

This application is a continuation of International Application No.PCT/CN2014/070922, filed on Jan. 20, 2014, which claims priority toChinese Patent Application No. 201310033292.4, filed on Jan. 29, 2013,both of which are hereby incorporated by reference in their entireties.

TECHNICAL FIELD

Embodiments of the present invention relate to communicationstechnologies, and in particular, to video SMS message sending andreceiving methods and apparatuses thereof, and a handheld electronicdevice.

BACKGROUND

With development of computer technologies and communicationstechnologies, and in particular, in an era when smartphones developrapidly, a requirement of a user for a mobile phone is no longer simplya call function and an SMS message function; instead, it is expectedthat a mobile phone can provide more various multimedia applicationservices. In addition, an operator also expects to earn more profits byusing a multimedia value-added service. Therefore, emergence of amultimedia message, a picture SMS message, and a video SMS message justmeets a market demand.

FIG. 1 is a schematic diagram of an implementation solution of a videotechnology in the prior art, where the solution is a currently popularone-to-one video SMS message implementation solution. As shown in FIG.1, for party A, mobile phone A collects an audio/video by using twoaudio/video collecting devices: a camera and a microphone that are onthe mobile phone, compresses the audio/video after collecting iscomplete to reduce transmission traffic, and then sends a compressedaudio/video to public network signaling and a data server. The dataserver first finds that a destination of the video SMS message is partyB, and that party B is online, and then sends the video SMS message toparty B. After completing receiving the video SMS message, party B firstdecompresses, then decodes, and plays the received video SMS message.

In the foregoing technology, collected content of a video needs to beelaborated by a sender by adding speech or text; a recipient needs tolisten to sound in the video to determine a key point that a userintends to share, so that a highlight in the video cannot be emphasized.

SUMMARY

Embodiments of the present invention provide video SMS message sendingand receiving methods and apparatuses thereof, and a handheld electronicdevice, to emphasize a key point in a video SMS message, therebyoptimizing a video SMS message display effect.

According to a first aspect, an embodiment of the present inventionprovides a video SMS message sending method, including acquiringaudio/video content, synchronously acquiring graffiti inputcorresponding to the audio/video content, and combining the audio/videocontent and the graffiti input into a graffiti video file; and sendingthe graffiti video file as a video SMS message to an SMS messagereceiver.

In a first possible implementation manner of the first aspect, theacquiring audio/video content, synchronously acquiring graffiti inputcorresponding to the audio/video content, and combining the audio/videocontent and the graffiti input into a graffiti video file includes:acquiring the audio/video content to form an audio/video file, andsynchronously acquiring the graffiti input corresponding to theaudio/video content to form a graffiti file; and separately storing theaudio/video file and the graffiti file in the graffiti video file, andrecording respective starting locations of the audio/video file and thegraffiti file.

According to the first possible implementation manner of the firstaspect, in a second possible implementation manner, the acquiring theaudio/video content to form an audio/video file, and synchronouslyacquiring the graffiti input corresponding to the audio/video content toform a graffiti file includes: acquiring audio/video content of acurrent frame, and recording the audio/video content in the audio/videofile. While acquiring the audio/video content of the current frame,collecting, at a transparent background layer, at least one type ofgraffiti input, combining the at least one type of graffiti input into agraffiti picture of the current frame, recording the graffiti picture ofthe current frame in the graffiti file, and recording attributeinformation of the graffiti picture of the current frame in a graffitiinformation maintenance table; and repeatedly executing the foregoingsteps to process each frame, until it is determined that recording ofthe audio/video content ends.

According to the second possible implementation manner of the firstaspect, in a third possible implementation manner, the attributeinformation of the graffiti picture of the current frame includes atleast: a size of the graffiti picture of the current frame, a time pointand an encoding format that are of the current frame, and duration ofthe current frame.

According to the first possible implementation manner of the firstaspect, in a fourth possible implementation manner, the acquiring theaudio/video content to form an audio/video file, and synchronouslyacquiring the graffiti input corresponding to the audio/video content toform a graffiti file includes: acquiring audio/video content of acurrent frame, and recording the audio/video content in the audio/videofile. While acquiring the audio/video content of the current frame,collecting, at a transparent background layer, at least one type ofgraffiti input, separately recording the at least one type of graffitiinput in the graffiti file, and separately recording attributeinformation of the at least one type of graffiti input in a graffitiinformation maintenance table. The method further includes repeatedlyexecuting the foregoing steps to process each frame, until it isdetermined that recording of the audio/video content ends.

In a fifth possible implementation manner of the first aspect, theacquiring audio/video content, synchronously acquiring graffiti inputcorresponding to the audio/video content, and combining the audio/videocontent and the graffiti input into a graffiti video file includes:acquiring audio/video content of a current frame, and storing theaudio/video content in an audio/video frame cache. While acquiring theaudio/video content of the current frame, the method includescollecting, at a transparent background layer, at least one type ofgraffiti input, and separately storing the at least one type of graffitiinput in the at least one type of graffiti frame cache. The methodfurther includes combining data that is in the audio/video frame cacheand the at least one type of graffiti frame cache and that is of thecurrent frame, and storing combined data in a combined-frame cache. Themethod further includes encoding data that is in the combined-framecache to form an audio/video of the current frame, and recording theaudio/video in the graffiti video file; and repeatedly executing theforegoing steps to process each frame, until it is determined thatrecording of the audio/video content ends.

According to any one of the second to the fifth possible implementationmanners of the first aspect, in a sixth possible implementation manner,the at least one type of graffiti input includes at least: touch trackinput and an additional picture.

According to the sixth possible implementation manner of the firstaspect, in a seventh possible implementation manner, attributeinformation of the touch track input includes: a coordinate, a linewidth, color, a time point, and duration that are of a touch track.

According to a second aspect, an embodiment of the present inventionprovides a video SMS message receiving method, including: receiving avideo SMS message sent by an SMS message sender; and acquiring, byparsing, audio/video content and graffiti input from the video SMSmessage, and superimposing the graffiti input on the audio/video contentto perform playing.

In a first possible implementation manner of the second aspect, theacquiring, by parsing, audio/video content and graffiti input from thevideo SMS message, and superimposing the graffiti input on theaudio/video content to perform playing includes: separately acquiring,at respective starting locations that are of an audio/video file and agraffiti file and indicated in a graffiti video file of the video SMSmessage, the audio/video file and the graffiti file by parsing;acquiring, by splitting, an audio layer and a video layer from theaudio/video file; acquiring a graffiti layer from the graffiti file;synchronizing the graffiti layer with the audio layer and the videolayer in terms of frame time according to attribute information, of agraffiti picture, recorded in a graffiti information maintenance tablein the graffiti file. The method further includes playing the audiolayer by using an audio channel, playing the video layer by using asecond layer of a video channel, playing the graffiti layer by using afirst layer of the video channel, where the first layer is superimposedon the second layer.

In a second possible implementation manner of the second aspect, theacquiring, by parsing, audio/video content and graffiti input from thevideo SMS message, and superimposing the graffiti input on theaudio/video content to perform playing includes separately acquiring, atrespective starting locations that are of an audio/video file and agraffiti file and indicated in a graffiti video file of the video SMSmessage, the audio/video file and the graffiti file by parsing. Themethod further includes acquiring, by splitting, an audio layer and avideo layer from the audio/video file; acquiring a graffiti layer fromthe graffiti file; synchronizing the at least one type of graffiti inputthat is at the graffiti layer with the audio layer and the video layerin terms of frame time according to attribute information, of the atleast one type of graffiti input, recorded in a graffiti informationmaintenance table in the graffiti file. The method further includesplaying the audio layer by using an audio channel, playing the videolayer by using a second layer of a video channel, playing the at leastone type of graffiti input that is at the graffiti layer by using afirst layer of the video channel separately, where the first layer issuperimposed on the second layer.

According to the second possible implementation manner of the secondaspect, in a third possible implementation manner, the at least one typeof graffiti input includes at least: touch track input and an additionalpicture.

According to a third aspect, an embodiment of the present inventionprovides a video SMS message sending apparatus, including an acquiringand combining module configured to acquire audio/video content,synchronously acquire graffiti input corresponding to the audio/videocontent, and combine the audio/video content and the graffiti input intoa graffiti video file; and a sending module configured to send thegraffiti video file as a video SMS message to an SMS message receiver.

In a first possible implementation manner of the third aspect, theacquiring and combining module includes a file acquiring unit,configured to acquire the audio/video content to form an audio/videofile, and synchronously acquire the graffiti input corresponding to theaudio/video content to form a graffiti file; and a file combining unit,configured to separately store the audio/video file and the graffitifile in the graffiti video file, and record respective startinglocations of the audio/video file and the graffiti file.

According to the first possible implementation manner of the thirdaspect, in a second possible implementation manner, the file acquiringunit includes an audio/video file acquiring subunit, configured toacquire audio/video content of a current frame, and record theaudio/video content in the audio/video file. A first graffiti fileacquiring subunit is configured to: while the audio/video content of thecurrent frame is being acquired, collect, at a transparent backgroundlayer, at least one type of graffiti input, combine the at least onetype of graffiti input into a graffiti picture of the current frame,record the graffiti picture of the current frame in the graffiti file,and record attribute information of the graffiti picture of the currentframe in a graffiti information maintenance table. Operations of theaudio/video file acquiring subunit and the first graffiti file acquiringsubunit are repeatedly executed to process each frame, until it isdetermined that recording of the audio/video content ends.

According to the second possible implementation manner of the thirdaspect, in a third possible implementation manner, the attributeinformation of the graffiti picture of the current frame includes atleast: a size of the graffiti picture of the current frame, a time pointand an encoding format that are of the current frame, and duration ofthe current frame.

According to the first possible implementation manner of the thirdaspect, in a fourth possible implementation manner, the file acquiringunit includes an audio/video file acquiring subunit configured toacquire audio/video content of a current frame, and record theaudio/video content in the audio/video file. A second graffiti fileacquiring subunit is configured to: while the audio/video content of thecurrent frame is being acquired, collect, at a transparent backgroundlayer, at least one type of graffiti input, separately record the atleast one type of graffiti input in the graffiti file, and separatelyrecord attribute information of the at least one type of graffiti inputin a graffiti information maintenance table. Operations of theaudio/video file acquiring subunit and the second graffiti fileacquiring subunit are repeatedly executed to process each frame, untilit is determined that recording of the audio/video content ends.

In a fifth possible implementation manner of the third aspect, theacquiring and combining module includes an audio/video cache unitconfigured to acquire audio/video content of a current frame, and storethe audio/video content in an audio/video frame cache. A graffiti cacheunit is configured to: while the audio/video content of the currentframe is being acquired, collect, at a transparent background layer, atleast one type of graffiti input, and separately store the at least onetype of graffiti input in the at least one type of graffiti frame cache.A cache combining unit is configured to combine data that is in theaudio/video frame cache and the at least one type of graffiti framecache and that is of the current frame, and store combined data in acombined-frame cache. A cache encoding unit is configured to encode datathat is in the combined-frame cache to form an audio/video of thecurrent frame, and record the audio/video in the graffiti video file.Operations of the audio/video cache unit, the graffiti cache unit, thecache combining unit, and the cache encoding unit are repeatedlyexecuted to process each frame, until it is determined that recording ofthe audio/video content ends.

According to any one of the second to the fifth possible implementationmanners of the third aspect, in a sixth possible implementation manner,the at least one type of graffiti input includes at least: touch trackinput and an additional picture.

According to the sixth possible implementation manner of the thirdaspect, in a seventh possible implementation manner, attributeinformation of the touch track input includes: a coordinate, a linewidth, color, a time point, and duration that are of a touch track.

According to a fourth aspect, an embodiment of the present inventionprovides a video SMS message receiving apparatus, including a receivingmodule configured to receive a video SMS message sent by an SMS messagesender. A parsing and playing module configured to acquire, by parsing,audio/video content and graffiti input from the video SMS message, andsuperimpose the graffiti input on the audio/video content to performplaying.

In a first possible implementation manner of the fourth aspect, theparsing and playing module includes a file parsing unit configured toseparately acquire, at respective starting locations that are of anaudio/video file and a graffiti file and indicated in a graffiti videofile of the video SMS message, the audio/video file and the graffitifile by parsing. An audio/video splitting unit is configured to acquire,by splitting, an audio layer and a video layer from the audio/videofile. A graffiti splitting unit is configured to acquire a graffitilayer from the graffiti file. A first time synchronization unit isconfigured to synchronize the graffiti layer with the audio layer andthe video layer in terms of frame time according to attributeinformation, of a graffiti picture, recorded in a graffiti informationmaintenance table in the graffiti file. A first playing unit isconfigured to play the audio layer by using an audio channel, play thevideo layer by using a second layer of a video channel, and play thegraffiti layer by using a first layer of the video channel, where thefirst layer is superimposed on the second layer.

In a second possible implementation manner of the fourth aspect, theparsing and playing module includes a file parsing unit configured toseparately acquire, at respective starting locations that are of anaudio/video file and a graffiti file and indicated in a graffiti videofile of the video SMS message, the audio/video file and the graffitifile by parsing. A n audio/video splitting unit is configured toacquire, by splitting, an audio layer and a video layer from theaudio/video file. A graffiti splitting unit is configured to acquire agraffiti layer from the graffiti file. A second time synchronizationunit is configured to synchronize the at least one type of graffitiinput that is at the graffiti layer with the audio layer and the videolayer in terms of frame time according to attribute information, of theat least one type of graffiti input, recorded in a graffiti informationmaintenance table in the graffiti file. A second playing unit isconfigured to play the audio layer by using an audio channel, play thevideo layer by using a second layer of a video channel, and play the atleast one type of graffiti input that is at the graffiti layer by usinga first layer of the video channel separately, where the first layer issuperimposed on the second layer.

According to the second possible implementation manner of the fourthaspect, in a third possible implementation manner, the at least one typeof graffiti input includes at least: touch track input and an additionalpicture.

According to a fifth aspect, an embodiment of the present inventionprovides a handheld electronic device including a memory, and furtherincluding the video SMS message sending apparatus according to anyembodiment of the present invention and the video SMS message receivingapparatus according to any embodiment of the present invention. Thehandheld electronic device further includes a camera configured tocollect video content; a microphone, configured to collect audiocontent; a touchscreen, configured to receive an input operation of auser; an audio channel, configured to play the audio content; and avideo channel, configured to play the video content and graffiti input.

According to video SMS message sending and receiving methods andapparatuses thereof, and a handheld electronic device that are providedby embodiments of the present invention, a highlight in a video isindicated by graffiti in a video SMS message shooting process, toemphasize a key point that is in the video SMS message, therebyoptimizing a video SMS message display effect.

BRIEF DESCRIPTION OF THE DRAWINGS

To describe the technical solutions in the embodiments of the presentinvention or in the prior art more clearly, the following brieflyintroduces the accompanying drawings required for describing theembodiments or the prior art. Apparently, the accompanying drawings inthe following description show some embodiments of the presentinvention, and persons of ordinary skill in the art may still deriveother drawings from these accompanying drawings without creativeefforts.

FIG. 1 is a schematic diagram of an implementation solution of a videoSMS message in the prior art;

FIG. 2 is a flowchart of a video SMS message sending method according toEmbodiment 1 of the present invention;

FIG. 3 is a flowchart of a video SMS message sending method according toEmbodiment 2 of the present invention;

FIG. 4 is a flowchart of a video SMS message sending method according toEmbodiment 3 of the present invention;

FIG. 5 is a flowchart of a video SMS message sending method according toEmbodiment 4 of the present invention;

FIG. 6 is a flowchart of a video SMS message receiving method accordingto Embodiment 5 of the present invention;

FIG. 7 is a flowchart of a video SMS message receiving method accordingto Embodiment 6 of the present invention;

FIG. 8 is a flowchart of a video SMS message receiving method accordingto Embodiment 7 of the present invention;

FIG. 9 is a schematic structural diagram of a video SMS message sendingapparatus according to Embodiment 8 of the present invention;

FIG. 10 is a schematic structural diagram of a video SMS message sendingapparatus according to Embodiment 9 of the present invention;

FIG. 11 is a schematic structural diagram of a video SMS message sendingapparatus according to Embodiment 10 of the present invention;

FIG. 12 is a schematic structural diagram of a video SMS messagereceiving apparatus according to Embodiment 11 of the present invention;

FIG. 13 is a schematic structural diagram of a video SMS messagereceiving apparatus according to Embodiment 12 of the present invention;and

FIG. 14 is a schematic structural diagram of a handheld electronicdevice according to Embodiment 13 of the present invention.

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS

To make the objectives, technical solutions, and advantages of theembodiments of the present invention clearer, the following clearly anddescribes the technical solutions in the embodiments of the presentinvention with reference to the accompanying drawings in the embodimentsof the present invention. Apparently, the described embodiments are somebut not all of the embodiments of the present invention. All otherembodiments obtained by persons of ordinary skill in the art based onthe embodiments of the present invention without creative efforts shallfall within the protection scope of the present invention.

Embodiment 1

FIG. 2 is a flowchart of a video SMS message sending method according toEmbodiment 1 of the present invention. The embodiment may be applied toa video SMS message where a handheld electronic device edits and addsgraffiti input, to emphasize a highlight that is in a video and that auser intends to share. The method may be executed by a video SMS messagesending apparatus. The apparatus may be implemented in a form ofhardware and/or software, and may be integrated in a handheld electronicdevice. The handheld electronic device may be any terminal device suchas a mobile phone and a tablet computer. The method specificallyincludes the following steps.

Step 201: Acquire audio/video content, synchronously acquire graffitiinput corresponding to the audio/video content, and combine theaudio/video content and the graffiti input into a graffiti video file.

Specifically, the audio/video content may be audio/video contentcollected by using two audio/video collecting devices: a camera and amicrophone that are in the handheld electronic device. The graffitiinput may be touch track input and/or additional picture informationthat are/is collected by using a touchscreen of the handheld electronicdevice. Combining in the step only needs to simply join, end to end, avideo file formed by the audio/video content and a graffiti file formedby the graffiti input, and add a file header to the graffiti video fileto record locations of the two files after superimposition, so as tofacilitate separation of the two files when file parsing is performed.

Step 202: Send the graffiti video file as a video SMS message to an SMSmessage receiver.

Specifically, the SMS message receiver is just relative to an SMSmessage sender, both of which are equal. In a communication process, theSMS message receiver and the SMS message sender may simultaneously senda video SMS message, or either one may send a video SMS message.

In the embodiment, in a video SMS message acquiring process, a highlightin a video is indicated by using graffiti input, so that a largequantity of words used to express content of a video SMS message in theprior art are changed into a graffiti manner, optimizing a video SMSmessage display effect, and also improving user experience andincreasing fun of video shooting.

Based on the foregoing embodiment, the operation of acquiringaudio/video content, synchronously acquiring graffiti inputcorresponding to the audio/video content, and combining the audio/videocontent and the graffiti input into a graffiti video file may bespecifically: acquiring the audio/video content to form an audio/videofile, and synchronously acquiring the graffiti input corresponding tothe audio/video content to form a graffiti file; separately storing theaudio/video file and the graffiti file in the graffiti video file, andrecording respective starting locations of the audio/video file and thegraffiti file. According to the technical solution, an independentaudio/video file and an independent graffiti file may be separatelyformed, and separately recorded for a receiver to perform acquiring byparsing, so that a user can perform an operation only on the graffitifile or the audio/video file, for example, after recording is complete,the user may further modify the graffiti file.

Embodiment 2

FIG. 3 is a flowchart of a video SMS message sending method according toEmbodiment 2 of the present invention. Based on the foregoingembodiment, the embodiment further optimizes the operation of acquiringthe audio/video content to form an audio/video file, and synchronouslyacquiring the graffiti input corresponding to the audio/video content toform a graffiti file. As shown in FIG. 3, the operation of acquiring theaudio/video content to form an audio/video file, and synchronouslyacquiring the graffiti input corresponding to the audio/video content toform a graffiti file may specifically include the following steps.

Step 301: Acquire audio/video content of a current frame, and record theaudio/video content in the audio/video file.

Step 302: While acquiring the audio/video content of the current frame,collect, at a transparent background layer, at least one type ofgraffiti input, combine the at least one type of graffiti input into agraffiti picture of the current frame, record the graffiti picture ofthe current frame in the graffiti file, and record attribute informationof the graffiti picture of the current frame in a graffiti informationmaintenance table.

The attribute information of the graffiti picture of the current frameincludes at least: a size of the graffiti picture of the current frame,a time point and an encoding format that are of the current frame, andduration of the current frame. The attribute information is recorded, sothat the graffiti picture may be restored by acquiring the attributeinformation by parsing at a receiver. The at least one type of graffitiinput may include at least: touch track input and an additional picture,and for example, may further include a transparent background picture.

Step 303: Repeatedly execute step 301 and step 302 to process eachframe, until it is determined that recording of the audio/video contentends.

The foregoing step 301 and step 302 are processing on each frame. Aprocessing operation on each frame is the same and is repeatedlyexecuted, and each frame is recorded in a file one by one, until therecording ends.

In the embodiment of the present invention, a graffiti picture of eachframe is synchronously formed, so as to reduce subsequent processingtime, for example, time of parsing and synchronizing after a receiverreceives a video SMS message is reduced; attribute information of agraffiti picture of a current frame is recorded in a graffitiinformation maintenance table of a graffiti file, so as to facilitateparsing when playing is performed, and when the graffiti picture of thecurrent frame is changed, the graffiti information maintenance table maybe updated in a timely manner, for example, when an operation, such aserasing touch track input or deleting an additional picture, isperformed, modification information is updated in real time in a partthat is corresponding to the current frame and that is in the graffitiinformation maintenance table, so that a user can perform an operationonly on the graffiti file or an audio/video file.

Embodiment 3

Based on the foregoing descriptions, as shown in FIG. 4, anotherimplementation manner of acquiring the audio/video content to form anaudio/video file, and synchronously acquiring the graffiti inputcorresponding to the audio/video content to form a graffiti file asdescribed below.

Step 401: Acquire audio/video content of a current frame, and record theaudio/video content in the audio/video file.

Step 402: While acquiring the audio/video content of the current frame,collect, at a transparent background layer, at least one type ofgraffiti input, separately record the at least one type of graffitiinput in the graffiti file, and separately record attribute informationof the at least one type of graffiti input in a graffiti informationmaintenance table.

The at least one type of graffiti input includes at least: touch trackinput and an additional picture. Attribute information of the touchtrack input includes: a coordinate, a line width, color, a time point,and duration that are of a touch track. The attribute information isrecorded, so that the at least one type of graffiti input may berestored by acquiring the attribute information by parsing at areceiver.

Step 403: Repeatedly execute step 401 and step 402 to process eachframe, until it is determined that recording of the audio/video contentends.

The foregoing step 401 and step 402 are processing on each frame. Aprocessing operation on each frame is the same and is repeatedlyexecuted, and each frame is recorded in a file one by one, until therecording of the audio/video content ends.

A difference between the embodiment of the present invention and theforegoing Embodiment 2 lies in that: in the embodiment of the presentinvention, the at least one type of graffiti input is not combined intoa graffiti picture of a current frame; instead, the at least one type ofgraffiti input is separately recorded in a graffiti file, and attributeinformation of the at least one type of graffiti input is separatelyrecorded in a graffiti information maintenance table, so that when onlyone type of graffiti input needs to be changed, only attributeinformation corresponding to the type of graffiti input is updated,thereby improving efficiency. For example, when touch track input needsto be erased or attribute information of the touch track input such as aline width and color needs to be changed, only the attribute informationthat is of the touch track input and that is in the graffiti informationmaintenance table is updated; when an additional picture needs to bedeleted or added, only attribute information that is of the additionalpicture and that is in the graffiti information maintenance table isupdated, so that a user can conveniently perform an operation only onone type of graffiti input.

Embodiment 4

FIG. 5 is a flowchart of a video SMS message sending method according toEmbodiment 4 of the present invention. Based on the foregoingembodiment, the embodiment further optimizes the operation of acquiringaudio/video content, synchronously acquiring graffiti inputcorresponding to the audio/video content, and combining the audio/videocontent and the graffiti input into a graffiti video file. As shown inFIG. 5, the acquiring audio/video content, synchronously acquiringgraffiti input corresponding to the audio/video content, and combiningthe audio/video content and the graffiti input into a graffiti videofile may specifically include the following steps.

Step 501: Acquire audio/video content of a current frame, and store theaudio/video content in an audio/video frame cache.

Specifically, the audio/video content may be collected by using a cameraand a microphone. During echoing of a video, audio/video data may bestored in the audio/video frame cache, and the audio/video frame cacheis a hardware resource and exists in all mobile terminal devices.

Step 502: While acquiring the audio/video content of the current frame,collect, at a transparent background layer, at least one type ofgraffiti input, and separately store the at least one type of graffitiinput in the at least one type of graffiti frame cache.

Specifically, the at least one type of graffiti input includes at least:touch track input and an additional picture, and for example, furtherincludes a transparent background picture. During echoing of the atleast one type of graffiti input, graffiti data is stored in the atleast one type of graffiti frame cache. For example, during echoing ofthe touch track input, graphic data formed by touching is stored in atouch display frame cache, while the additional picture and the like arestored in a user interface display frame cache.

Step 503: Combine data that is in the audio/video frame cache and the atleast one type of graffiti frame cache and that is of the current frame,and store combined data in a combined-frame cache.

Specifically, layer data that is in the audio/video frame cache and ofthe current frame and layer data that is in the at least one type ofgraffiti frame cache and of the current frame are combined into data ofa main layer by using a layer (or an audio) mixer. Data aftercombination is final data sent to a display device. For example, data ofeach layer is in an ARGB32 format. Because a main layer is displayedfinally, and the main layer is formed by combining multiple layers, amanner of the combining specifically refers to that a piece ofcorresponding control data describes data of a layer corresponding toeach area, and that a mixer performs AND/OR combination on the data ofeach layer according to the information.

Step 504: Encode data that is in the combined-frame cache to form anaudio/video of the current frame, and record the audio/video in thegraffiti video file.

Specifically, the encoding uses a compression coding standard to reducetransmission traffic, for example, an H.264/MPEG4 standard is used.

Step 505: Repeatedly execute step 501 to step 504 to process each frame,until it is determined that recording of the audio/video content ends.

The foregoing step 501 to step 504 are processing on each frame. Aprocessing operation on each frame is the same and is repeatedlyexecuted, and each frame is recorded in the graffiti video file one byone, until the recording of the audio/video content ends.

According to the embodiment of the present invention, audio/videocontent of a current frame and the at least one type of graffiti inputof the current frame are combined by using a hardware mixer, and whenplaying and parsing are performed, there is no need to separately parseand process the audio/video content and the graffiti input, therebyreducing parsing time of a receiver, and improving playing efficiency.

Embodiment 5

FIG. 6 is a flowchart of a video SMS message receiving method accordingto Embodiment 5 of the present invention. As shown in FIG. 6, theembodiment may be applied to a video SMS message where a handheldelectronic device plays and adds graffiti input, to emphasize ahighlight that is in a video and that a user intends to share. Themethod may be executed by a video SMS message receiving apparatus. Theapparatus may be implemented form of hardware and/or software, and maybe integrated in a handheld electronic device. The handheld electronicdevice may be any terminal device such as a mobile phone and a tabletcomputer. The method specifically includes the following steps.

Step 601: Receive a video SMS message sent by an SMS message sender.

Specifically, the video SMS message is a video SMS message wheregraffiti input is added to audio/video content of a current frame.

Step 602: Acquire, by parsing, audio/video content and graffiti inputfrom the video SMS message, and superimpose the graffiti input on theaudio/video content to perform playing.

Specifically, the audio/video content is audio/video content collectedby using two audio/video collecting devices: a camera and a microphonethat are in a handheld electronic device. The graffiti input is touchtrack input and/or additional picture information that are/is collectedby using a touchscreen of the handheld electronic device.

According to the embodiment, in a video SMS message playing process, alarge quantity of words used to express content are changed into agraffiti manner, optimizing a video SMS message display effect, and alsoimproving user experience and increasing fun of video shooting.

Embodiment 6

FIG. 7 is a flowchart of a video SMS message receiving method accordingto Embodiment 6 of the present invention. Based on the foregoingembodiment, the embodiment further optimizes the operation of acquiring,by parsing, audio/video content and graffiti input from the video SMSmessage, and superimposing the graffiti input on the audio/video contentto perform playing. As shown in FIG. 7, the acquiring, by parsing,audio/video content and graffiti input from the video SMS message, andsuperimposing the graffiti input on the audio/video content to performplaying may include the following steps.

Step 701: Separately acquire, at respective starting locations that areof an audio/video file and a graffiti file and indicated in a graffitivideo file of the video SMS message, the audio/video file and thegraffiti file by parsing.

Specifically, the audio/video file and the graffiti file that are in thegraffiti video file are simply joined end to end. In addition, thegraffiti video file includes a file header that records the startinglocations of the two files after the superimposition, so as tofacilitate separation of the two files when file parsing is performed.

Step 702: Acquire, by splitting, an audio layer and a video layer fromthe audio/video file.

Specifically, the audio layer may be sound data collected by using amicrophone, and the video layer may be video data collected by using acamera.

Step 703: Acquire a graffiti layer from the graffiti file.

Specifically, the graffiti layer may be a graffiti picture formed byimage information, such as touch track input, an additional picture, ora transparent background picture, collected by using a touchscreen.

Step 704: Synchronize the graffiti layer with the audio layer and thevideo layer in terms of frame time according to attribute information,of a graffiti picture, recorded in a graffiti information maintenancetable in the graffiti file.

Specifically, in the graffiti information maintenance table, there ismainly attribute information of a graffiti picture of each frame, suchas a size, a time point, an encoding format, and duration. The graffitilayer is synchronized with the audio layer and the video layer in termsof frame time according to the time point and the duration that are ofthe graffiti picture.

Step 705: Play the audio layer by using an audio channel, play the videolayer by using a second layer of a video channel, play the graffitilayer by using a first layer of the video channel, where the first layeris superimposed on the second layer.

Specifically, because a graffiti file at the graffiti layer stores agraffiti picture that is after combining graffiti input, for example,information such as a transparent background picture, touch track input,and an additional picture, playing of the graffiti layer is the same asplaying of the video layer, that is, the graffiti layer can be directlysent to the video channel to be displayed. In addition, because thegraffiti input is added to video content of a current frame, thegraffiti layer is superimposed on the video layer to be played.

In the embodiment of the present invention, a graffiti picture of eachframe is synchronously combined with audio/video content of the currentframe, which can reduce frame synchronization time. Attributeinformation of a graffiti picture of a current frame is recorded in agraffiti information maintenance table of a graffiti file, so as tofacilitate parsing. In addition, according to the technical solution, anindependent audio/video file and an independent graffiti file may beacquired by parsing, and a user may perform, based on a graffiti video,a second modification or a second graffiti making, so that the user canedit the graffiti video any time.

Embodiment 7

FIG. 8 is a flowchart of a video SMS message receiving method accordingto Embodiment 7 of the present invention. As shown in FIG. 8, based onthe foregoing embodiment, according to the embodiment of the presentinvention, the acquiring, by parsing, audio/video content and graffitiinput from the video SMS message, and superimposing the graffiti inputon the audio/video content to perform playing may be further optimizedas follows.

Step 801: Separately acquire, at respective starting locations that areof an audio/video file and a graffiti file and indicated in a graffitivideo file of the video SMS message, the audio/video file and thegraffiti file by parsing.

Step 802: Acquire, by splitting, an audio layer and a video layer fromthe audio/video file.

Step 803: Acquire a graffiti layer from the graffiti file.

Specifically, the graffiti layer may be touch track input and/or anadditional picture that are/is collected by using a touchscreen.

Step 804: Synchronize the at least one type of graffiti input that is atthe graffiti layer with the audio layer and the video layer in terms offrame time according to attribute information, of the at least one typeof graffiti input, recorded in a graffiti information maintenance tablein the graffiti file.

Specifically, the at least one type of graffiti input includes at least:touch track input and an additional picture. Attribute information ofthe touch track input may be a coordinate, a line width, color, a timepoint, duration and the like that are of a touch track. Attributeinformation of the additional picture may be a size, a time point,duration and the like that are of the picture. The touch track input andthe additional picture may be synchronized with the audio layer and thevideo layer in terms of frame time according to a time point andduration that are corresponding to the touch track input and theadditional picture.

Step 805: Play the audio layer by using an audio channel, play the videolayer by using a second layer of a video channel, play the at least onetype of graffiti input that is at the graffiti layer by using a firstlayer of the video channel separately, where the first layer issuperimposed on the second layer.

Specifically, because the graffiti file at the graffiti layer stores theat least one type of graffiti input, for example, information such asthe touch track input and the additional picture, playing of thegraffiti layer and playing of the video layer are classified into atleast two cases: if it is the additional picture, the additional picturecan be directly sent to the first layer of the video channel to bedisplayed; if it is the touch track input, the touch track input isdrawn at the first layer of the video channel according to information,such as the coordinate, the line width, and the color, of the touchtrack. In addition, because the graffiti input is added to video contentof a current frame, the graffiti layer is superimposed on the videolayer to be played.

According to the embodiment of the present invention, attributeinformation of the at least one type of graffiti input is separatelyrecorded in a graffiti information maintenance table, the at least onetype of graffiti input that is independent may be acquired by parsing,so that a user can edit a graffiti video any time. In addition, when theuser separately performs a second operation on the at least one type ofgraffiti input in the graffiti video, for example, modification orgraffiti remaking, it only needs to separately update, based on anoriginal graffiti video, the at least one type of graffiti inputcorrespondingly, and it no longer needs to combine the at least one typeof graffiti input into a graffiti picture of a current frame to replacean original graffiti picture, so that the second operation is moreconvenient and faster.

Embodiment 8

FIG. 9 is a schematic structural diagram of a video SMS message sendingapparatus according to Embodiment 8 of the present invention. Theapparatus of the embodiment may be used to execute a technical solutionof a video SMS message sending method provided by an embodiment of thepresent invention. The apparatus of the embodiment specificallyincludes: an acquiring and combining module 91 and a sending module 92.The acquiring and combining module 91 is configured to acquireaudio/video content, synchronously acquire graffiti input correspondingto the audio/video content, and combine the audio/video content and thegraffiti input into a graffiti video file. The sending module 92 isconfigured to send the graffiti video file as a video SMS message to anSMS message receiver.

The apparatus of the embodiment may be used to execute a technicalsolution of a video SMS message sending method embodiment provided by anembodiment of the present invention. Implementation principles andtechnical effects thereof are similar, and are not described hereinagain.

Based on the foregoing descriptions, the acquiring and combining module91 preferably includes: a file acquiring unit 911 and a file combiningunit 912. The file acquiring unit 911 is configured to acquire theaudio/video content to form an audio/video file, and synchronouslyacquire the graffiti input corresponding to the audio/video content toform a graffiti file. The file combining unit 912 is configured toseparately store the audio/video file and the graffiti file in thegraffiti video file, and record respective starting locations of theaudio/video file and the graffiti file.

Based on the foregoing descriptions, the file acquiring unit 911preferably includes an audio/video file acquiring subunit 9111,configured to acquire audio/video content of a current frame, and recordthe audio/video content in the audio/video file. A first graffiti fileacquiring subunit 9112 is configured to: while the audio/video contentof the current frame is being acquired, collect, at a transparentbackground layer, at least one type of graffiti input, combine the atleast one type of graffiti input into a graffiti picture of the currentframe, record the graffiti picture of the current frame in the graffitifile, and record attribute information of the graffiti picture of thecurrent frame in a graffiti information maintenance table.

Operations of the audio/video file acquiring subunit 9111 and the firstgraffiti file acquiring subunit 9112 are repeatedly executed to processeach frame, until it is determined that recording of the audio/videocontent ends.

Specifically, the attribute information of the graffiti picture of thecurrent frame includes at least: a size of the graffiti picture of thecurrent frame, a time point and an encoding format that are of thecurrent frame, and duration of the current frame. The at least one typeof graffiti input includes at least: touch track input and an additionalpicture, and may further include a transparent background picture andthe like. Attribute information of the touch track input includes: acoordinate, a line width, color, a time point, and duration that are ofa touch track.

Embodiment 9

Based on the foregoing descriptions, as shown in FIG. 10, a fileacquiring unit 101 may include: an audio/video file acquiring subunit1011 and a second graffiti file acquiring subunit 1012. The audio/videofile acquiring subunit 1011 is configured to acquire audio/video contentof a current frame, and record the audio/video content in an audio/videofile. The second graffiti file acquiring subunit 1012 is configured to:while the audio/video content of the current frame is being acquired,collect, at a transparent background layer, at least one type ofgraffiti input, separately record the at least one type of graffitiinput in a graffiti file, and separately record attribute information ofthe at least one type of graffiti input in a graffiti informationmaintenance table. Operations of the audio/video file acquiring subunit1011 and the second graffiti file acquiring subunit 1012 are repeatedlyexecuted to process each frame, until it is determined that recording ofthe audio/video content ends. The at least one type of graffiti inputincludes at least: touch track input and an additional picture.Attribute information of the touch track input includes: a coordinate, aline width, color, a time point, and duration that are of a touch track.

The apparatus of the embodiment may be used to execute a technicalsolution of a video SMS message sending method embodiment provided by anembodiment of the present invention. Implementation principles andtechnical effects thereof are similar, and are not described hereinagain.

Embodiment 10

Referring to FIG. 11, based on the foregoing embodiment, an acquiringand combining module 11 may include: an audio/video cache unit 111, agraffiti cache unit 112, a cache combining unit 113, and a cacheencoding unit 114. The audio/video cache unit 111 is configured toacquire audio/video content of a current frame, and store theaudio/video content in an audio/video frame cache. The graffiti cacheunit 112 is configured to: while the audio/video content of the currentframe is being acquired, collect, at a transparent background layer, atleast one type of graffiti input, and separately store the at least onetype of graffiti input in the at least one type of graffiti frame cache.The cache combining unit 113 is configured to combine data that is inthe audio/video frame cache and the at least one type of graffiti framecache and that is of the current frame, and store combined data in acombined-frame cache. The cache encoding unit 114 is configured toencode data that is in the combined-frame cache to from an audio/videoof the current frame, and record the audio/video in a graffiti videofile. Operations of the audio/video cache unit 111, the graffiti cacheunit 112, the cache combining unit 113, and the cache encoding unit 114are repeatedly executed to process each frame, until it is determinedthat recording of the audio/video content ends. The at least one type ofgraffiti input includes at least: touch track input and an additionalpicture.

The apparatus of the embodiment may be used to execute a technicalsolution of a video SMS message sending method embodiment provided by anembodiment of the present invention. Implementation principles andtechnical effects thereof are similar, and are not described hereinagain.

Embodiment 11

FIG. 12 is a schematic structural diagram of a video SMS messagereceiving apparatus according to Embodiment 11 of the present invention.The apparatus of the embodiment may be used to execute a technicalsolution of a video SMS message receiving method provided by Embodiment5 of the present invention. The apparatus of the embodiment specificallyincludes: a receiving module 12 and a parsing and playing module 13. Thereceiving module 12 is configured to receive a video SMS message sent byan SMS message sender. The parsing and playing module 13 is configuredto acquire, by parsing, audio/video content and graffiti input from thevideo SMS message, and superimpose the graffiti input on the audio/videocontent to perform playing.

The apparatus of the embodiment may be used to execute a technicalsolution of a video SMS message receiving method embodiment provided byan embodiment of the present invention. Implementation principles andtechnical effects thereof are similar, and are not described hereinagain.

Based on the foregoing descriptions, the parsing and playing module 13preferably includes: a file parsing unit 131, an audio/video splittingunit 132, a graffiti splitting unit 133, a first time synchronizationunit 134, and a first playing unit 135. The file parsing unit 131 isconfigured to separately acquire, at respective starting locations thatare of an audio/video file and a graffiti file and indicated in agraffiti video file of the video SMS message, the audio/video file andthe graffiti file by parsing. The audio/video splitting unit 132 isconfigured to acquire, by splitting, an audio layer and a video layerfrom the audio/video file. The graffiti splitting unit 133 is configuredto acquire a graffiti layer from the graffiti file. The first timesynchronization unit 134 is configured to synchronize the graffiti layerwith the audio layer and the video layer in terms of frame timeaccording to attribute information, of a graffiti picture, recorded in agraffiti information maintenance table in the graffiti file. The firstplaying unit 135 is configured to play the audio layer by using an audiochannel, play the video layer by using a second layer of a videochannel, and play the graffiti layer by using a first layer of the videochannel, where the first layer is superimposed on the second layer. Theat least one type of graffiti input includes at least: touch track inputand an additional picture.

Embodiment 12

Based on the foregoing descriptions, as shown in FIG. 13, a parsing andplaying module 14 may further preferably include: a file parsing unit141, an audio/video splitting unit 142, a graffiti splitting unit 143, asecond time synchronization unit 144, and a second playing unit 145. Thefile parsing unit 141 is configured to separately acquire, at respectivestarting locations that are of an audio/video file and a graffiti fileand indicated in a graffiti video file of the video SMS message, theaudio/video file and the graffiti file by parsing. The audio/videosplitting unit 142 is configured to acquire, by splitting, an audiolayer and a video layer from the audio/video file. The graffitisplitting unit 143 is configured to acquire a graffiti layer from thegraffiti file. The second time synchronization unit 144 is configured tosynchronize the at least one type of graffiti input that is at thegraffiti layer with the audio layer and the video layer in terms offrame time according to attribute information, of the at least one typeof graffiti input, recorded in a graffiti information maintenance tablein the graffiti file. The second playing unit 145 is configured to playthe audio layer by using an audio channel, play the video layer by usinga second layer of a video channel, and play the at least one type ofgraffiti input that is at the graffiti layer by using a first layer ofthe video channel separately, where the first layer is superimposed onthe second layer. The at least one type of graffiti input includes atleast: touch track input and an additional picture.

Embodiment 13

FIG. 14 is a schematic structural diagram of a handheld electronicdevice according to Embodiment 13 of the present invention. The handheldelectronic device provided by the embodiment of the present inventionincludes a memory, a camera 100, a microphone 200, a touchscreen 300, avideo SMS message sending apparatus 400 provided by any embodiment ofthe present invention, a video SMS message receiving apparatus 500provided by any embodiment of the present invention, an audio channel600, and a video channel 700. The camera 100 is configured to collectvideo content. The microphone 200 is configured to collect audiocontent. The touchscreen 300 is configured to receive an input operationof a user. The audio channel 600 is configured to play the audiocontent. The video channel 700 is configured to play the video contentand graffiti input.

The handheld electronic device may implement all functions of theforegoing embodiments. Implementation principles and beneficial effectsthereof are similar, and are not described herein again.

Persons of ordinary skill in the art may understand that all or some ofthe steps of the method embodiments may be implemented by a programinstructing relevant hardware. The program may be stored in acomputer-readable storage medium. When the program runs, the steps ofthe method embodiments are performed. The foregoing storage mediumincludes: any medium that can store program code, such as a ROM, a RAM,a magnetic disk, or an optical disc.

Finally, it should be noted that the foregoing embodiments are merelyintended for describing the technical solutions of the presentinvention, but not for limiting the present invention. Although thepresent invention is described in detail with reference to the foregoingembodiments, persons of ordinary skill in the art should understand thatthey may still make modifications to the technical solutions describedin the foregoing embodiments or make equivalent replacements to some orall technical features thereof, without departing from the scope of thetechnical solutions of the embodiments of the present invention.

What is claimed is:
 1. A video SMS message sending method comprising:acquiring audio/video content, synchronously acquiring graffiti inputcorresponding to the audio/video content, and combining the audio/videocontent and the graffiti input into a graffiti video file; and sendingthe graffiti video file as a video SMS message to an SMS messagereceiver.
 2. The method according to claim 1, wherein the acquiringaudio/video content, synchronously acquiring graffiti inputcorresponding to the audio/video content, and combining the audio/videocontent and the graffiti input into a graffiti video file comprises:acquiring the audio/video content to form an audio/video file, andsynchronously acquiring the graffiti input corresponding to theaudio/video content to form a graffiti file; and separately storing theaudio/video file and the graffiti file in the graffiti video file, andrecording respective starting locations of the audio/video file and thegraffiti file.
 3. The method according to claim 2, wherein the acquiringthe audio/video content to form an audio/video file, and synchronouslyacquiring the graffiti input corresponding to the audio/video content toform a graffiti file comprises: acquiring audio/video content of acurrent frame, and recording the audio/video content in the audio/videofile; while acquiring the audio/video content of the current frame,collecting, at a transparent background layer, at least one type ofgraffiti input, combining the at least one type of graffiti input into agraffiti picture of the current frame, recording the graffiti picture ofthe current frame in the graffiti file, and recording attributeinformation of the graffiti picture of the current frame in a graffitiinformation maintenance table; and repeatedly executing the foregoingsteps of acquiring, recording, collecting, combining, recording, andrecording to process each frame, until it is determined that recordingof the audio/video content ends.
 4. The method according to claim 3,wherein the attribute information of the graffiti picture of the currentframe comprises at least: a size of the graffiti picture of the currentframe, a time point and an encoding format that are of the currentframe, and duration of the current frame.
 5. The method according toclaim 3, wherein the at least one type of graffiti input comprises atleast: touch track input and an additional picture.
 6. The methodaccording to claim 2, wherein the acquiring the audio/video content toform an audio/video file, and synchronously acquiring the graffiti inputcorresponding to the audio/video content to form a graffiti filecomprises: acquiring audio/video content of a current frame, andrecording the audio/video content in the audio/video file; whileacquiring the audio/video content of the current frame, collecting, at atransparent background layer, at least one type of graffiti input,separately recording the at least one type of graffiti input in thegraffiti file, and separately recording attribute information of the atleast one type of graffiti input in a graffiti information maintenancetable; and repeatedly executing the foregoing steps of acquiring,recording, collecting, combining, recording, and recording to processeach frame, until it is determined that recording of the audio/videocontent ends.
 7. A video SMS message receiving method comprising:receiving a video SMS message sent by an SMS message sender; andacquiring, by parsing, audio/video content and graffiti input from thevideo SMS message, and superimposing the graffiti input on theaudio/video content to perform playing.
 8. The method according to claim7, wherein the acquiring, by parsing, audio/video content and graffitiinput from the video SMS message, and superimposing the graffiti inputon the audio/video content to perform playing comprises: separatelyacquiring, at respective starting locations that are of an audio/videofile and a graffiti file and indicated in a graffiti video file of thevideo SMS message, the audio/video file and the graffiti file byparsing; acquiring, by splitting, an audio layer and a video layer fromthe audio/video file; acquiring a graffiti layer from the graffiti file;synchronizing the graffiti layer with the audio layer and the videolayer in terms of frame time according to attribute information, of agraffiti picture, recorded in a graffiti information maintenance tablein the graffiti file; and playing the audio layer by using an audiochannel, playing the video layer by using a second layer of a videochannel, playing the graffiti layer by using a first layer of the videochannel, wherein the first layer is superimposed on the second layer. 9.The method according to claim 7, wherein the acquiring, by parsing,audio/video content and graffiti input from the video SMS message, andsuperimposing the graffiti input on the audio/video content to performplaying comprises: separately acquiring, at respective startinglocations that are of an audio/video file and a graffiti file andindicated in a graffiti video file of the video SMS message, theaudio/video file and the graffiti file by parsing; acquiring, bysplitting, an audio layer and a video layer from the audio/video file;acquiring a graffiti layer from the graffiti file; synchronizing the atleast one type of graffiti input that is at the graffiti layer with theaudio layer and the video layer in terms of frame time according toattribute information, of the at least one type of graffiti input,recorded in a graffiti information maintenance table in the graffitifile; and playing the audio layer by using an audio channel, playing thevideo layer by using a second layer of a video channel, playing the atleast one type of graffiti input that is at the graffiti layer by usinga first layer of the video channel separately, wherein the first layeris superimposed on the second layer.
 10. The method according to claim9, wherein the at least one type of graffiti input comprises at least:touch track input and an additional picture.
 11. A video SMS messagesending apparatus comprising a processor and a non-transitorycomputer-readable medium storing program modules executable by theprocessor, the modules including: an acquiring and combining moduleconfigured to acquire audio/video content, synchronously acquiregraffiti input corresponding to the audio/video content, and combine theaudio/video content and the graffiti input into a graffiti video file;and a sending module configured to send the graffiti video file as avideo SMS message to an SMS message receiver.
 12. The apparatusaccording to claim 11, wherein the acquiring and combining modulecomprises: a file acquiring unit configured to acquire the audio/videocontent to form an audio/video file, and synchronously acquire thegraffiti input corresponding to the audio/video content to form agraffiti file; and a file combining unit configured to separately storethe audio/video file and the graffiti file in the graffiti video file,and record respective starting locations of the audio/video file and thegraffiti file.
 13. The apparatus according to claim 12, wherein the fileacquiring unit comprises: an audio/video file acquiring subunitconfigured to acquire audio/video content of a current frame, and recordthe audio/video content in the audio/video file; and a first graffitifile acquiring subunit configured to: while the audio/video content ofthe current frame is being acquired, collect, at a transparentbackground layer, at least one type of graffiti input, combine the atleast one type of graffiti input into a graffiti picture of the currentframe, record the graffiti picture of the current frame in the graffitifile, and record attribute information of the graffiti picture of thecurrent frame in a graffiti information maintenance table, whereinoperations of the audio/video file acquiring subunit and the firstgraffiti file acquiring subunit are repeatedly executed to process eachframe, until it is determined that recording of the audio/video contentends.
 14. The apparatus according to claim 13, wherein the attributeinformation of the graffiti picture of the current frame comprises atleast: a size of the graffiti picture of the current frame, a time pointand an encoding format that are of the current frame, and duration ofthe current frame.
 15. The apparatus according to claim 13, wherein theat least one type of graffiti input comprises at least: touch trackinput and an additional picture.
 16. The apparatus according to claim12, wherein the file acquiring unit comprises: an audio/video fileacquiring subunit configured to acquire audio/video content of a currentframe, and record the audio/video content in the audio/video file; and asecond graffiti file acquiring subunit configured to: while theaudio/video content of the current frame is being acquired, collect, ata transparent background layer, at least one type of graffiti input,separately record the at least one type of graffiti input in thegraffiti file, and separately record attribute information of the atleast one type of graffiti input in a graffiti information maintenancetable, wherein operations of the audio/video file acquiring subunit andthe second graffiti file acquiring subunit are repeatedly executed toprocess each frame until it is determined that recording of theaudio/video content ends.
 17. A video SMS message receiving apparatuscomprising a processor and a non-transitory computer-readable mediumstoring program modules executable by the processor, the modulesincluding: a receiving module configured to receive a video SMS messagesent by an SMS message sender; and a parsing and playing moduleconfigured to acquire, by parsing, audio/video content and graffitiinput from the video SMS message, and superimpose the graffiti input onthe audio/video content to perform playing.
 18. The apparatus accordingto claim 17, wherein the parsing and playing module comprises: a fileparsing unit configured to separately acquire, at respective startinglocations that are of an audio/video file and a graffiti file andindicated in a graffiti video file of the video SMS message, theaudio/video file and the graffiti file by parsing; an audio/videosplitting unit configured to acquire, by splitting, an audio layer and avideo layer from the audio/video file; a graffiti splitting unitconfigured to acquire a graffiti layer from the graffiti file; a firsttime synchronization unit configured to synchronize the graffiti layerwith the audio layer and the video layer in terms of frame timeaccording to attribute information, of a graffiti picture, recorded in agraffiti information maintenance table in the graffiti file; and a firstplaying unit configured to play the audio layer by using an audiochannel, play the video layer by using a second layer of a videochannel, and play the graffiti layer by using a first layer of the videochannel, wherein the first layer is superimposed on the second layer.19. The apparatus according to claim 17, wherein the parsing and playingmodule comprises: a file parsing unit configured to separately acquire,at respective starting locations that are of an audio/video file and agraffiti file and indicated in a graffiti video file of the video SMSmessage, the audio/video file and the graffiti file by parsing; anaudio/video splitting unit configured to acquire, by splitting, an audiolayer and a video layer from the audio/video file; a graffiti splittingunit configured to acquire a graffiti layer from the graffiti file; asecond time synchronization unit configured to synchronize the at leastone type of graffiti input that is at the graffiti layer with the audiolayer and the video layer in terms of frame time according to attributeinformation, of the at least one type of graffiti input, recorded in agraffiti information maintenance table in the graffiti file; and asecond playing unit configured to play the audio layer by using an audiochannel, play the video layer by using a second layer of a videochannel, and play the at least one type of graffiti input that is at thegraffiti layer by using a first layer of the video channel separately,wherein the first layer is superimposed on the second layer.
 20. Theapparatus according to claim 19, wherein the at least one type ofgraffiti input comprises at least: touch track input and an additionalpicture.
 21. A handheld electronic device comprising a processor and amemory, and further comprising: a video SMS message sending apparatuscomprising first program modules executable by the processor, the firstprogram modules comprising: an acquiring and combining module configuredto acquire audio/video content, synchronously acquire graffiti inputcorresponding to the audio/video content, and combine the audio/videocontent and the graffiti input into a graffiti video file; a sendingmodule configured to send the graffiti video file as a video SMS messageto an SMS message receiver; a video SMS message receiving apparatuscomprising second program modules executable by the processor, thesecond program modules comprising: a receiving module configured toreceive a video SMS message sent by an SMS message sender; and a parsingand playing module configured to acquire, by parsing, audio/videocontent and graffiti input from the video SMS message, and superimposethe graffiti input on the audio/video content to perform playing; acamera configured to collect video content; a microphone configured tocollect audio content; a touchscreen configured to receive an inputoperation of a user; an audio channel configured to play the audiocontent; and a video channel configured to play the video content andgraffiti input.